Published online 16 December 2012 



Nucleic Acids Research, 2013, Vol. 41, No. 3 1767-1782 

doi:10.1093/nar/gksl203 



Competition between the GlnR and PhoP regulators 
for the glnA and amtB promoters in Streptomyces 
coelicolor 

Alberto Sola-Landa 1 , Antonio Rodriguez-Garcfa 1 ' 2 , Rafat Amin 3 , Wolfgang Wohlleben 3 
and Juan F. Martin 1,2 '* 

1 INBIOTEC, Instituto de Biotecnologia de Leon, Avda. Real n°. 1, Parque Cientifico de Leon, 24006 Leon, 
2 Area de Microbiologia, Departamento de Biologia Molecular, Universidad de Leon, Campus de Vegazana s/n; 
24071 Leon, Spain and 3 lnterfakultares Institut fur Mikrobiologie und Infektionsmedizin, Mikrobiologie/ 
Biotechnologie, Eberhard-Karls-Universitat, Auf der Morgenstelle 28, 72076 Tubingen, Germany 

Received January 31, 2011; Revised October 26, 2012; Accepted October 30, 2012 



ABSTRACT 

Interaction of regulatory networks is a subject of 
great interest in systems biology of bacteria. 
Phosphate control of metabolism in Streptomyces 
is mediated by the two-component system PhoR- 
PhoP. Similarly, the utilization of different nitrogen 
sources is controlled by the regulator GlnR. 
Transcriptomic and biochemical analysis revealed 
that glnA (encoding a glutamine synthetase), glnR 
and other nitrogen metabolism genes are under 
PhoP control. DNA-binding experiments showed 
that PhoP binds to other nitrogen-regulated genes 
(SCO0255, SCO01863 and ureA). Using the glnA 
promoter as model, we observed that PhoP and 
GlnR compete for binding to the same promoter 
region, showing GlnR a higher affinity. Using a 
total of 14 GlnR-binding sites (50 direct repeat 
units) we established two information-based 
models that describe the GlnR box as consisting 
of two 11 -nt direct repeats each with clear differ- 
ences to PHO box. DNA-binding studies with differ- 
ent mutant sequences of glnA promoter revealed 
that the sequence recognized by GlnR is found in 
the coding strand whereas that recognized by 
PhoP is overlapping in the non-coding strand. In 
amtB promoter PhoP and GlnR boxes are not 
totally overlapping and both proteins bind simultan- 
eously. PhoP control of nitrogen metabolism genes 
helps to balance the cellular P/N equilibrium. 



INTRODUCTION 

Soil-dwelling actinomycetes produce a large array of bio- 
active secondary metabolites (1). Biosynthesis of these 
secondary metabolites is controlled by the availability 
of carbon, nitrogen and phosphate sources (2,3). 
Particularly relevant is the concentration in the culture 
medium of easily utilizable nitrogen sources (4) and phos- 
phate (5,6). Specific sets of genes, including those for 
secondary metabolite biosynthesis, are upregulated in 
response to phosphate deprivation (7,8) or ammonium 
limitation (4,9). 

Phosphate control of metabolism in several 
Streptomyces species is mediated by the two-component 
PhoR-PhoP system (10-12). Expression of genes belong- 
ing to the pho regulon in Streptomyces coelicolor is posi- 
tively regulated by binding of the phosphorylated response 
regulator PhoP (PhoP^P) to operators which contain two 
or more 1 1-nt direct repeat units (DRus). Two DRus form 
the so-called PHO box (13,14). It should be noted that 
protein-binding sites are defined by the conservation of 
the sequence of one strand, but proteins interact with 
both strands of the DNA. 

The number and organization of DRus has been 
determined in several PhoP-regulated promoters by 
directed mutagenesis of the nucleotides in the conserved 
positions of those DRus, by electrophoretic mobility shift 
assays (EMSA) and by footprinting assays (15,16). The 
regulatory mechanism involves cooperative binding of 
phosphorylated PhoP to the core (highly conserved) 
DRus followed by the binding of additional PhoP^P 
monomers to adjacent less conserved DRus (15). 
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Nitrogen sources and phosphate control microbial 
growth through interconnected networks. The overall 
nitrogen metabolism is regulated in S. coelicolor by mech- 
anisms that involve an apparent duplication of some 
structural and regulatory genes (4). A central role in 
nitrogen metabolism is played by the glutamine synthetase 
that assimilates ammonium into the cellular organic 
nitrogen. 

There are five genes in S. coelicolor which encode glu- 
tamine synthetase-like proteins (17), two of these proteins 
have been shown to possess glutamine synthetase activity: 
GlnA (glutamine synthetase I, (3-subtype) and Glnll (eu- 
karyotic type glutamine synthetase II) (18). Three genes 
amtB, glnK, glnD form an operon (9) encoding three 
proteins that play an important role in ammonium 
transport and metabolism. amtB encodes a putative 
ammonium transporter and glnK codes for the 
protein PII (19), a signal transmitter protein which — in 
Enterobacteriaceae — is involved in modulating the activa- 
tion/inactivation of the glutamine synthetase by 
adenylation/deadenylation (20). The third gene (glnD) in 
the amtB-glnK-glnD operon encodes an enzyme (PII 
nucleotidyl transferase) that modifies the protein PII 
post-translationally by adenylylation (19). In contrast to 
other bacteria, the adenylyltransferase GlnE which acti- 
vates/inactivates GlnA in response to the nitrogen avail- 
ability (21) is not controlled by the GlnD/PII system in 
S. coelicolor (19). 

Two regulatory genes glnR (22,23) and glnRII control 
expression of the nitrogen metabolism genes at the tran- 
scriptional level (9,24). GlnR is the main nitrogen regula- 
tor in S. coelicolor and binds to so-called 'GlnR-boxes' in 
the promoter regions of gin A, glnll, amtB and other 
nitrogen metabolism genes. The GlnR box has been 
proposed to be formed by 22 nt with a consensus 
sequence gTnAc-n 6 -GaAAc-n 6 (24). These sequences are 
formed by two direct repeats of different degree of con- 
servation (the so-called a- and b-sites). GlnRII, a second 
response regulator similar to GlnR, is encoded by an ORF 
located downstream of glnll (9), although the exact 
nucleotide sequence recognized by this protein has not 
been described. When compared to PhoP binding the 
mechanism of GlnR interaction with the GlnR-boxes is 
not well-known. 

Recently, we found that phosphate exerts a negative 
control over several genes involved in nitrogen metabol- 
ism in S. coelicolor (25,26). The response regulator (PhoP) 
of phosphate metabolism binds to the glnR promoter, 
encoding the major nitrogen regulator as shown by 
EMSA studies, but not to the glnRII promoter under iden- 
tical experimental conditions. PhoP also binds to the pro- 
moters of gin A, glnll and to the amtB-glnK-glnD operon. 
Thus, the negative control is both direct and indirect 
through the glnR promoter binding. In vivo expression 
studies using luxAB as reporter showed that PhoP 
represses the above mentioned nitrogen metabolism 
genes (25). 

Two-component regulatory systems consist of a sensor 
kinase and a cognate response regulator (27,28). PhoP and 
GlnR are related proteins belonging to the OmpR family 
of response regulators (29). PhoP is the response regulator 



of the two-component system PhoR-PhoP (10), whereas 
GlnR is an orphan response regulator (22). The response 
regulators have several residues that form the phosphor- 
ylation pocket near the N-end of the protein and an 
aspartic acid (the phosphorylated residue) at the middle 
of the N-terminal domain that defines the so-called 
'typical' response regulators (30). GlnR is classified as a 
'typical' orphan response regulator because it contains 
these motifs in its structure, but it is not linked to a 
cognate sensor kinase (29). Both PhoP and GlnR 
contain an HTH (helix-turn-helix) DNA-binding domain 
near their C-terminal ends (13,24). 

Taking into account the similarity of PhoP and GlnR, it 
was of great interest to clarify the relationship existing 
between these two global regulators in Streptomyces. The 
purpose of this work was to determine whether these two 
regulatory proteins were competing for interaction with 
sequences that could be overlapping in the glnA and 
other nitrogen-regulated promoters, or if there is a cross- 
regulation, i.e. if there is a control by GlnR over PhoP- 
regulated genes in addition to the previously described 
binding of PhoP to GlnR-regulated promoters (25). 

MATERIALS AND METHODS 

Regulatory proteins expression and purification 

The fusion protein GST-PhoP DBD was expressed in 
Escherichia coli DH5oc and purified in an AKTA-FPLC 
using a Glutathione Sepharose 4B column (GE 
Healthcare) as described previously (13). The 
GST-PhoP bBD protein was eluted with lOmM reduced 
glutathione (in 50 mM Tris-HCl, pH 8.0) and conserved 
in 40% glycerol at — 80° C before use. GlnR was expressed 
in E. coli with an N-terminal StrepII-tag and purified with 
StrepTactin Superflow gravity-flow columns (IB A), as 
described previously (24). Protein was stored at — 80°C 
in 50 mM Tris-HCl (pH 8.0), 100 mM NaCl and 10 mM 
P-mercaptoethanol. 

DNA manipulations 

Promoters of SCO0255 (from —315 to +32 with respect to 
translation start site), SCO 1863 (from -340 to +13), 
SC02195 (from -261 to +7) and ureA (from -278 to 
+ 19) for EMSA analyses were amplified by PCR and 
cloned into pGEM-T Easy (Promega). The phoRP, pstS, 
glnA, amtB and glnll promoters were obtained from 
pBS-PphoRP, pGEM-P^S (13), pGEM-PglnA, 
pGEM-PamtB and pGEM-Pglnll (25) plasmids, respect- 
ively. For analysis of the glnA promoter, different mutants 
were constructed with the QuickChange Site-Directed 
Mutagenesis Kit (Stratagene) using the pGEM-PglnA 
plasmid as template. The primers used in this work are 
listed in the Supplementary Table SI. The correct ampli- 
fication was tested by sequencing using an ABI PRISM 
3130 Genetic Analyzer (Applied Biosystems). 

Electrophoretic mobility shift assays 

DNA-protein interaction was tested by EMSA analysis. 
The promoters were excised from the corresponding 
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plasmids by endonuclease digestion and labelled at both 
ends with digoxigenin using the DIG Oligonucleotide 
3'-End Labeling Kit, 2nd Generation (Roche Applied 
Science). Binding reactions with GST-PhoP DBD protein 
were performed in 10 mM Tris-HCl (pH 8.0), 0.4 mM 
MgCl 2 , lOmM KC1, 0.2mM DTT, 1.6mM reduced gluta- 
thione, 0.01% Nonidet P40 and 13% glycerol (named 
buffer B in this work), for 30 min at 30°C (13). Binding 
reactions with GlnR protein were performed in 50 mM 
Tris-HCl (pH 8.0), 100 mM NaCl and 10 mM 
(3-mercaptoethanol (named buffer A), for 15 min at 
room temperature (24). Alternatively, competition experi- 
ments and binding reactions with mutated promoters were 
performed in 50 mM Tris-HCl (pH 8.0), lOOmM NaCl, 
0.4 mM (3-mercaptoethanol, 0.18mM reduced glutathione 
and 5.76% glycerol, for 20 min at room temperature, 
both for GST-PhoP DBD and GlnR proteins. In competi- 
tion experiments the two proteins were added simultan- 
eously to avoid a preferential binding of the probe for 
one of the proteins. In any case an excess of 50 (ig/ml of 
poly[d(I-C)] was added to every reaction mixture as 
internal control to avoid unspecific binding of protein 
to DNA. The samples were loaded onto a 5% polyacryl- 
amide native gel (14 cm x 16 cm) in 0.5X TBE buffer, 
running the electrophoresis at 80 V for 5h. DNA was 
electroblotted onto a nylon membrane in 0.5X TBE 
buffer (1 h, 200 mA), fixed by UV cross-linking and 
detected with anti-digoxigenin antibodies by chemilumini- 
scence with the CDP-Star™ reagent (Roche Applied 
Science). 

DNase I footprinting assays 

The sequences bound by the Strep-GlnR protein in the 
SCO0888 and SC07155 genes were determined by the 
non-radioactive DNase I footprinting assay described pre- 
viously (31,32). The promoter regions of both genes were 
obtained by PCR using primer pairs CAR87 and CAR88, 
and CAR89 and CAR90, respectively. PCR products 
were digested with Sphl-Mlul and inserted into the 
pGEM-5zf(+) vector (Promega) to yield pAR-N9 and 
pAR-Nll, respectively. Cloned sequences comprised the 
region of the SCO0888 gene from —141 to +11 (positions 
numbered with respect to the translation start site), and 
the region —264 to —13 of the SC07155 gene. Using these 
plasmids as templates, DNA probes were obtained by 
PCR using one 6-FAM-modified primer for labelling 
only one strand. Labelled and unlabelled primer pairs cor- 
respond to the T7 and SP6 promoter sequences of the 
pGEM-5zf(+) vector. The corresponding labelled 
primers were used also for sequencing with the Thermo 
Sequenase Primer Cycle Sequencing Kit (GE Healthcare). 

Reaction mixtures contained 9.3 nM of labelled DNA 
probes. Two reaction buffers were assayed yielding the 
same results. One buffer is based on the one used by 
Tiffert et al. (24) and it is characterized by the high salt 
concentration: 50 mM Tris-HCl pH 8, 100 mM NaCl, 6% 
glycerol, 0.4 mM MgCl 2 , 1.0 mM DTT and 50ug/ml 
poly[d(I-C)]. The other had the usual composition of foot- 
printing buffers: 50 mM Tris-HCl pH 8, 50 mM KC1, 6% 
glycerol, 1.0 mM MgCl 2 , 0.2 mM DTT and 10u£/ml 
poly[d(I-C)]. Control reactions without protein were 



supplemented with the protein solution buffer. After 
binding of the Strep-GlnR protein to DNA (30°C, 
20 min), DNase I (Roche) digestion was carried out 
during 1 min at 30°C (from 0.5 10 _1 to 2.0 10 _1 units/ml). 

Reactions were phenolized, ethanol precipitated and 
loaded into an ABI PRISM 3130 sequencer together 
with the molecular standard Gene-Scan® 500 LIZ™ 
(Applied Biosystems). Electropherograms were analysed 
with PeakScanner vl.O software (Applied Biosystems) to 
determine the protected sequence. 

The binding sites of PhoP in the SCO0255, SCO 1863 
and ureA promoters were determined by the footprinting 
procedure described above. The plasmids that contained 
the promoters cloned in pGEM-T Easy were used as PCR 
templates for labelling. The reaction conditions were 
those of Santos-Beneit et al. (32) and 4uM of the 
GST-PhoP DBD protein. 

'Information theory' analysis of binding sites 

The Delila package programs (makebk, encode, rseq, 
dalvec, makelogo, ri and lister) were used to calculate 
the information content (R{ value) of individual sequences 
(33), and to obtain logos and walkers for the analysis of 
binding sites (34,35). To calculate the information content 
a weight matrix from the frequencies of each nucleotide at 
each position of the aligned sequences is generated and 
applied to the sequences themselves to determine the con- 
servation of each individual sequence (33). The logo is a 
representation made by the alignment of different se- 
quences where the height of each letter is proportional 
to the frequency of that base in the alignment, and the 
height of the letter stack is the conservation in bits at 
that position (33). The positive or negative height of 
each letter in a walker shows the contribution of that 
base to the average sequence conservation of the binding 
site (35). Promoter sequences were scanned for binding 
sites using the RSA tools server (36) and information 
score matrices. 

RESULTS 

PhoP binds to the promoter region of several additional 
genes of nitrogen metabolism 

Since we have shown in a previous article that PhoP binds 
to the promoters of glnR, glnA, glnll and amtB (25), we 
decided to analyse all the promoters described to be 
regulated by GlnR (24) looking for the presence of PHO 
boxes using the matrix model version 2 of PhoP operator. 
Sola-Landa et al. (15) described the structure of PhoP 
operators (named model 1 operator). This model relies 
in an information content matrix calculated from the 
alignment of 11 -nt sequences using the Delila programs 
(33). These sequences are DRus that form the PhoP- 
binding sites. In this work, we updated the PhoP- 
binding site model (named model 2 operator) taking into 
account the sequences of the SC02878 operator (15), and 
the recently experimentally confirmed operators found in 
the afsS promoter (32), in the glnR, glnA, glnll, amtB 
promoter regions (25), and those in the glpQl and glpQ2 
promoters (37). As with model 1, model 2 matrix included 
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only type C (core) DRus (54 DRUs in model 2, 17 more 
than in model 1). We found four new N metabolism can- 
didates to be regulated by PhoP namely: SCO0255 (coding 
for a putative transcriptional regulator), SCO 1863 (a 
hypothetical protein), SC02195 (another hypothetical 
protein) and ureA (encoding the urease gamma subunit). 
These promoters were analysed by EMSA, obtaining a 
positive PhoP binding with SCO0255, SCO 1863 and 
ureA, although in this last case the binding was detected 
only with a high concentration of protein (Figure 1 A). The 
PhoP-binding sites in these promoters were determined by 
footprinting (see Supplementary Figure SI). The SCO0255 
promoter was protected from DNase I digestion by the 
GST-PhoP DBD protein on 80 nt in the coding strand and 
79 on the complementary strand. This large protected 
region contains 7 DRus of 1 1 nt with different degree of 
conservation (located between positions —183 to —108 
from the start codon). These consecutive sequences 
which form the PhoP-binding sites are located on the 
non-coding strand. The four first DRus overlap with the 
two putative GlnR boxes which are located in the coding 
strand (24) (Figure IB and Supplementary Figure SI A). 

The SCO 1863 promoter was protected in both strands in 
two large regions (Supplementary Figure SIB). The first 
region was clearly protected, as shown by the lowered 
electropherogram peaks, and comprised 2 DRus in the 
coding strand (positions —138 to —117), and 2 DRus in 
the non-coding strand (positions —116 to —94) 
(Figure IB). This configuration of two opposite PHO 
boxes has not been described previously. Interestingly, 
the two putative GlnR boxes located previously (24) 
overlap this region (Figure IB). A second region of the 
SCO 1863 promoter contained several protected peaks 
intertwined with some unprotected ones across 70 base 
pairs (—304 to —235) (Supplementary Figure SIC). The 
bioinformatics analysis of this sequence using PhoP 
model 2 revealed only two separated DRus with positive 
information content (Figure IB). These results can be ex- 
plained taking into account the EMSA results. As shown in 
Figure 1A, at lower protein concentrations one DNA- 
protein complex of faster migration is formed (lane 0,25); 
but higher protein concentrations result in the DNA- 
protein complex of slower migration only (lane 1). It is 
proposed that the first complex is due to the occupancy 
of the PHO boxes identified in the main protected region 
that would act as the site core; higher protein concentra- 
tions allow PhoP to bind also the second region of low 
conservation in a cooperative fashion. The promoter of 
the urease gamma subunit gene, ureA, was poorly 
protected from DNase I digestion (Supplementary 
Figure SID), what is in agreement with the need of high 
protein concentrations to obtain gel retardation 
(Figure 1A). The protected sequences were located at pos- 
itions — 1 86 to — 1 62 in the coding strand, and from — 1 50 to 
— 190 in the complementary strand. There are two DRus in 
the coding strand sequence of low information content 
values (0 and 6 bits, respectively). These low values 
account for the poor PhoP-binding affinity. As occurs 
with the other promoters, the proposed GlnR-binding 
site [two boxes located from —194 to —151; see (24)] 
overlaps the PhoP-binding site of this gene (Figure IB). 



GlnR does not bind to PhoP-dependent promoters 

Additionally, it was of interest to study if there is a recip- 
rocal cross-regulation of both response regulators, i.e. if 
GlnR binds to well-known PhoP-dependent promoters. 
The possible binding of GlnR to the promoters of 
phoRP and pstS (encoding the high affinity phosphate 
transporter) was tested using EMSA. Results 
(Supplementary Figure S2) showed that PhoP binds and 
forms two DNA-PhoP complexes (arrows) with both the 
phoRP (Supplementary Figure S2, lanes 2-3) and pstS 
(lanes 7-8) promoters. However, purified GlnR did not 
form any discrete retarded bands (Supplementary 
Figure S2, lanes 4-5 and 9-10), even at high protein con- 
centration (the gel in Supplementary Figure S2 was 
overexposed to detect any possible DNA-protein 
complex). 

These results indicate that there is no affinity of GlnR 
for the PhoP-dependent promoters and it is consistent 
with the lack of authentic GlnR boxes in these promoters. 
Interestingly, the opposite reaction, i.e. binding of PhoP to 
several promoters of nitrogen metabolism genes occurs 
with high affinity (25). 

PhoP and GlnR compete for binding to the glnA promoter 

In previous experiments GlnR binding has been made in 
buffer A (24), and PhoP binding in buffer B (13) (see com- 
position in Materials and Methods section). In order to 
perform competition experiments, binding of both PhoP 
and GlnR to glnA, glnll and amtB promoters was tested in 
both buffers A and B; a better binding was observed in the 
three cases in buffer A (data not shown). A slightly 
modified buffer A, hereafter named buffer MA (for 
modified A), was thereafter used, containing a reduced 
concentration (0.4 mM) of (3-mercaptoethanol (instead of 
lOmM in the normal buffer A), a small amount of 
reduced glutathione (0.18mM) and glycerol (5.76%), 
both coming from the GST-PhoP DBD stock solution. 

As shown in Supplementary Figure S3, both DNA- 
binding proteins (PhoP and GlnR) formed discrete 
complexes with the glnAp probe in buffer MA. At low 
protein concentrations, the glnAp-PhoP and glnAp-G\nR 
complexes can be easily separated due to their different 
molecular weight. The GST-PhoP DBD (39kDa) fusion 
protein is larger than GlnR with the Strep-tag (29kDa). 
At increasing concentrations, GlnR showed higher 
affinity than PhoP for the glnA promoter and retards com- 
pletely the probe. Moreover, the probe is shifted to a 
second complex (incorporating other two GlnR molecules, 
see Discussion section) that migrates very close to that 
formed by PhoP with this promoter (lanes 4-5). 

To clarify the possible competition or cooperation 
between both regulatory proteins a new experiment was 
made using reduced GlnR concentrations (4-fold dilution 
with respect to the previous experiment) to avoid the for- 
mation of this second shifted band. With the reduced 
protein concentrations used in the new experiment, 
only a shifted band was detected with GlnR (Figure 2A, 
lanes 2-5), and two different bands (I and II) were 
detected with the highest concentration of PhoP (lane 9), 
as described previously (25). When PhoP was mixed with 
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Figure 1. (A) Analysis by EMSA of binding of PhoP to different promoters regulated by GlnR. P, probe without protein. Numbers indicate the 
GST-PhoP DBD concentration (|iM). The conditions of binding reaction were the usual for PhoP (13). (B) Summary of the DNase I footprints of 
SCO0255, SCO 1863 and ureA genes with GST-PhoP DBD protein. The protected nucleotides are indicated by asterisks and the hypersensitive sites 
created by protein binding are highlighted by vertical arrows. Coordinates are relative to the translation start codons. The 11 -nt direct repeats that 
form the PhoP-binding sites are indicated by grey arrows; the direct repeats that constitute the GlnR boxes according to Tiffert et al. (24) are 
indicated by black arrows. The Ri values above each arrow (information content) of the PhoP repeats were calculated using model 2. 
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PhoP 
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GATCCGACCCCTCCC |^GGCCAATTG|AAGACGCTTTG| r [lTACCCAGTGC| 3GGCTCTTTAGTGGGCAGGGATCCCAGCTCCTT 

Figure 2. (A) Competition between GlnR and PhoP to bind to the glnA promoter. Lane 1, probe without proteins; lanes 2-5, increasing concen- 
trations of GlnR protein (from 0.03125 fiM to 0.25 fiM); lanes 6-9, increasing concentrations of GST-PhoP DBD protein (from 0.125 |iM to 1 \M); 
lanes 10-13, increasing concentrations of GlnR protein (from 0.03125 fiM to 0.25 fiM) mixed with GST-PhoP DBD protein at a constant concentration 
of 1 fiM. Shifted bands are indicated by arrows. Complexes glnAp-PhoF I and II refer to those described previously (25). (B) Overview of the glnA 
promoter showing the PhoP DRus (boxed and shadowed) and GlnR-boxes (boxed). The protected regions by PhoP of the upper and bottom strands 
are indicated by solid lines (25). The —10 box and the transcription start point (tsp) according to Fisher and Wray (38) are also shown. 



GlnR and added simultaneously to the DNA probe there 
was a competition between both proteins and both types 
of complexes glnAp-GXnR and glnAp-PhoP were seen 
(Figure 2, lane 10). When increasing concentrations of 
GlnR were used in the presence of PhoP, the largest 
PhoP-DNA complex (glnAp-PhoP II) was not formed 
(lane 13) and the intensity of the small complex (glnAp- 
PhoP I) clearly decreased. The glnAp-G\nR complex 
became the predominant one. No new supershift 
complex corresponding to the binding of both proteins 
at the same time was detected, indicating that both 
proteins compete for the promoter and only one can be 
bound in a certain moment. 

PhoP and GlnR bind to non-overlapping sequences in the 
amtB promoter 

In a similar way a competition experiment was performed 
with amtB promoter. With increasing concentrations of 
GlnR only one shifted band can be detected (Figure 3A, 
lanes 2-5) and two bands are obtained with PhoP 
(Figure 3, lanes 6-9). When both proteins were present 
at high concentrations a new different (supershifted) 
band was detected (lane 13). Unlike the glnA promoter, 
where GlnR- and PHO boxes are mostly overlapping 
(Figure 2B), in the amtB promoter the different boxes 



are adjacent (Figure 3B) and both proteins can be 
bound simultaneously to the promoter. 

Models of the GlnR and PhoP-binding sites and 
footprinting validation of the GlnR model operator 

Models of the protein-binding sites on DNA help to char- 
acterize known operators and to predict new binding sites. 
According to the model of Sola-Landa et al. (15) the 
simplest PhoP operator is formed by two consecutive 
DRus (also called a PHO box). Each protein monomer 
binds a DRu; two or three conserved DRus are bound 
simultaneously and form the core of the site (type C 
DRu). Following the core occupancy, other protein 
monomers can bind to adjacent DRus that are poorly 
conserved (type E, for extension, DRu). As indicated 
above in this article, we introduced the matrix model 2 
and its logo is shown in Figure 4A. The sequences of the 
new nitrogen metabolism promoters regulated by PhoP 
described in this work (SCO0255, SC01863 and ureA) 
were not included in model 2 because it was not possible 
to define which DRus form the core of the binding sites 
(Supplementary Data SI). 

Tiffert et al. (24) determined the binding of the GlnR 
protein to the upstream regions of 13 genes by EMSA. 
Using the alignment of the coding-strand sequences, 
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Figure 3. (A) Competition between GlnR and PhoP to bind to the amtB promoter. Lane 1, probe without proteins; lanes 2-5, increasing concen- 
trations of GlnR protein (from 0.25 jiM to 2jiM); lanes 6-9, increasing concentrations of GST-PhoP DBD protein (from 0.25 jiM to 2lxM); lanes 10- 
13, increasing concentrations of GlnR protein (from 0.25 jiM to 2jiM) mixed with GST-PhoP DBD protein at a constant concentration of 2fiM. 
Shifted bands are indicated by arrows. (B) Overview of the amtB promoter showing the PhoP DRus (boxed and shadowed) and GlnR-boxes (boxed). 
The protected regions by PhoP of the upper and bottom strands are indicated by solid lines (25). The PhoP and GlnR DRus have been numbered for 
an easier explanation of the text. The tsp PI previously described (9) is shown. 



a GlnR box was defined as composed of two partially 
conserved direct repeats of 1 1 nt. All the binding sites 
appeared to contain two consecutive GlnR boxes. 
Positive retardation in EMSA assays of a synthetic 22-nt 
fragment (corresponding to the sequence of the glnA 
promoter) proved that a single box (22 nt) can be bound 
by GlnR. Footprinting assays confirmed protection 
of two GlnR boxes in the glnA- and nirB-hmdmg sites, 
but only one box was protected in the gdhA upstream 
sequence (24). 

We created an initial information content matrix of the 
GlnR-binding site using the 26 GlnR boxes (two from 
each promoter region) proposed by Tiffert et al. (24). 
The matrix scan on the 13 promoter regions recognized 
by GlnR identified the same reported boxes except in 
the gdhA, SCO0888 (coding for a putative NADPH- 
dependent FMN reductase) and SC07155 (a hypothetical 
protein) upstream sequences. In the case of gdhA, the 
matrix yielded a positive value of information content 
(Ri value) for the DNase I-protected box, and a negative 
value for the adjacent sequence. Within the accuracy of 
the matrix, it has been demonstrated that the R[ value of 
true binding sites should be positive (35). 

In the SC07155 sequence only one of the two reported 
GlnR boxes showed a positive R{ value. Instead of the 
GlnR boxes previously proposed in the coding strand 
of the SCO0888 promoter (24), the matrix identified 
a putative GlnR box in the non-coding strand 
(Figure 4B). We decided to confirm both predictions 
with footprinting assays. The promoter regions of 
SCO0888 and SC07155 genes were cloned as detailed in 
Materials and Methods section. Coding and non-coding 



strands were separately labelled to determine the protected 
nucleotides from DNase I digestion in each strand 
(Figure 5). When comparing the electropherograms of 
control reactions with those of the fragments with the 
GlnR protein bound, it was evident that the Strep-GlnR 
protein, in addition to protecting the binding sequence, 
altered the pattern of peaks flanking the binding site; 
this alteration is a well-known phenomenon and indicates 
a spatial rearrangement of the DNA chain. The accumu- 
lations of protected nucleotides served to locate the region 
corresponding to the binding site. The GlnR protected 
region in the SCO0888 promoter was located between nu- 
cleotides —35 and —63 in the non-coding strand 
(Figure 5A), and between —58 and —34 in the coding 
strand (Figure 5B). This region comprised the predicted 
GlnR box, located at positions —39 to —60 (Figure 5E). 
In the SC07155 gene, almost all the nucleotides in the 
coding strand between —183 and —154 were protected 
(Figure 5C). The opposite strand was protected at 
several positions between —184 and —153 (Figure 5D). 
Of the four direct repeats in this promoter that form the 
GlnR boxes proposed by Tiffert et al. (24) the first direct 
repeat was not protected by GlnR (Figure 5E). In fact, 
the information content of this repeat was negative (see 
below). 

Taking into account the results of these footprinting 
experiments and those of Wang and Zhao (39), who 
have recently reported a GlnR-binding site in the nasA 
promoter, we built two new information models named 
GlnR/22 and GlnR/ 11 to analyse the GlnR-binding sites 
by means of sequence logos and walkers (see Materials 
and Methods section). The models were built from the 
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Figure 4. (A) Sequence logos of the model GlnR/11 and of the PhoP 
model 2. The sine wave represents the accessibility of a face of the 
DNA (B-form, 11 bases of helical pitch). The height of each letter is 
proportional to the frequency of that base in the aligned sequences used 
to build the model, and the height of the letter stack is the conservation 
in bits at that position (33). Error bars are shown at the top of the 
stacks. (B) Sequence logo of the model GlnR/22 and alignment of the 
25 sequences that form the GlnR boxes included in this model. When 
two boxes are present, they are consecutive except in nets A. In this 
promoter the second box, i.e. the 3rd and 4th repeats, is separated 
by 1 nt from the first one. The clearly conserved positions are indicated 
by a grey shade. 



set of GlnR-binding sequences shown in Figure 4B. Two 
GlnR boxes for each gene were included except for the 
gdhA, SCO0888 and SC07155 upstream sequences, 
where, as indicated, only one GlnR box was identified. 



The model GlnR/22 was created from the full sequences 
of 22 nt that form the GlnR boxes. This model best re- 
flected the higher conservation of the second repeat in 
each GlnR box (Figure 4B), which corresponds to the 
so-called 'b-site' identified by Tiffert et al. (24). 

In contrast, model GlnR/11 comprised the same set of 
sequences, but decomposed in 11 -nt repeats (Figure 4B). 
This last model served to calculate separately the informa- 
tion content of each 1 1-nt repeat, e.g. the first repeat in the 
SC07155 GlnR-binding site. This 11-nt repeat had a 
negative Ri value and was not protected, whereas the 
second one showed a clear positive value (Figure 5E). 
These results indicate that GlnR can bind three consecu- 
tive direct repeats of 11-nt each. 

Dissection of the GlnR/PhoP recognition sequences by 
directed mutagenesis 

In order to determine whether PhoP and GlnR could bind 
to the same region (although recognizing different se- 
quences) and to discriminate two putative PHO sites 
located in the sense or in the antisense strands of the 
glnA promoter, nine new promoters containing point mu- 
tations named Ml, M2, M5, M7, M8, M9, M10, M(l + 5) 
and M(l + 8) (the last two contain double mutations) were 
constructed by directed mutagenesis of the glnA promoter 
and used for EMSA analyses. The PhoP information 
matrix 2 and the GlnR/11 model were used to calculate 
the Ri values of wild-type and mutant sequences, and also 
to create sequence walkers as described previously (25). 
The binding of PhoP and GlnR to those new promoters 
was compared to that of the original glnA promoter. 

The previous footprinting assays using the GST- 
PhoP DB protein revealed a protected region in the glnA 
promoter of 29 nt and 35 nt in the coding and non-coding 
strands, respectively. The structure of this binding site 
showed two possible interpretations. The site might be 
composed of three consecutive core DRus (a CCC struc- 
ture) located in the coding strand or, alternatively, 
composed of two consecutive core DRus and an 
upstream extension DRu separated by 1 nt (a E[1]CC struc- 
ture) located in the non-coding strand (25). The R{ values of 
the DRu of the CCC structure are 3.0,-11.0 and 9.0 bits, 
respectively. The high negative value of the second DRu 
would hamper the formation of stable PhoP-glnAp 
complexes suggesting that the actual structure is that of 
the non-coding strand (E[1]CC), with R { values of 0.5, 
—2.8 and 8.3 bits, respectively (Figure 6). To determine 
the actual binding site structure the nucleotides 
GGTCAC of the third DRu located in the coding strand 
(positions 3 and 4 of the 11-nt DRu) were changed to 
GGGAAC (new promoter glnA-Mlp). If the actual struc- 
ture is the sense strand CCC, this change should decrease 
the affinity for PhoP (from 9.0 of the wild- type sequence to 
—0.1 of the M7p sequence) but analysis by EMSA showed 
a higher affinity of M7p for PhoP than that of the wild type 
(as shown by the strong retardation of the labelled probe 
resulting in the formation of two retarded bands) 
(Figure 6). However, interestingly the M7 mutation leads 
to the creation in the non-coding strand of a new first DRu 
without any separation from the others with a higher R^ 
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Figure 5. DNase I footprints of the Strep-GlnR protein bound to the promoter regions of SCO0888 [(A), non-coding strand; (B), coding strand], and 
SC07155 [(C), coding strand; (D), non-coding strand]. The same protection patterns were obtained with protein concentrations ranging from 2jiM to 
8 fiM. In each panel, the upper electropherogram is the control reaction without protein. The protected nucleotides are indicated by asterisks and the 
peak shadowed areas. Hypersensitive sites created by the protein binding, located on the sides of the protected sequence, are highlighted by vertical 
arrows. The correspondence between fluorescence peaks and nucleotide bases was determined using sequencing reactions. (E) Summary of the 
protection results. Coordinates are relative to the translation start codons. The 11 -nt direct repeats that form the GlnR boxes are indicated by 
horizontal solid arrows over the non-coding sequence of SCO0888 and over the coding sequence of SC07155. The Ri value of the repeat, calculated 
using model GlnR/ 11 is shown above each arrow (see text). The first arrow over the SC07155 sequence is open to indicate that the repeat, which was 
previously proposed as part of the binding site (24), is not protected. 
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Figure 5. Continued. 



value (3.0 bits) and without penalty by the separation (15). 
These positive changes in the R{ value agreed with the 
observed increased binding of PhoP to the M7 promoter 
and indicate that the structure should be defined in the 
non-coding strand. This was confirmed with the other mu- 
tations (a summary of the different mutations showing the 
values obtained for the new DRus and the results obtained 



by EMSA is depicted in Figure 6). For example, conversion 
of GTTAAC in the third DRu of the non-coding strand 
(with a value of 8.3 bits) to GCGAAC (-2.6 bits), leads, 
as expected, to a total lack of binding of PhoP to the 
new promoter glnA-M5p. These results confirmed that 
the PhoP-binding site structure (E[1]CC) is defined by the 
DRus present in the complementary strand of gin A. 
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Analysing all the results we observed that negative 
mutations in the second or third DRu drastically 
reduced the binding of PhoP [promoters M2, M5, M8 
and M(l + 5)], supporting the conclusion that these two 
DRus form the core of the PhoP-binding site. Moreover, 
the modifications in the first DRu show the importance of 
the separation of 1 nt between this DRu and the core. 
Thus, a mutation increasing the R{ value (from 0.5 to 
5.5 bits in the Ml 0 promoter) but keeping the 1-nt separ- 
ation, barely affects the affinity of PhoP in EMSA studies, 
whereas modifications resulting in a new DRu with a 
small R[ increase (to 3.0 bits in M7p) or even a slightly 
negative value (—0.8 in Mlp) but without any gap 
(nucleotide separation) clearly increased the binding of 
PhoP (Figure 6). 

Changes in GlnR interaction in the mutant glnA 
promoters confirm the DRu of this promoter 

Similarly, the same promoters were analysed by EMSA 
with the Strep-GlnR protein (Figure 7). The GlnR 
operator of glnA is composed of two consecutive GlnR 
boxes (24). Our results indicated that the most important 
GlnR box is the first one (in the sense strand of glnA), and 
that this is the first sequence bound by GlnR. First, it was 
in this box that the negative mutations had a more drastic 
effect [M2, M5, M9 and M(l + 5)]. Indeed, in cases where 
mutated DRu had negative R x values (M5 and M9 pro- 
moters) these mutations lead to a complete loss of GlnR 
binding. Second, negative mutations in the first repeat of 
the second GlnR box that showed moderate decreases of 
the R { values did not affect the GlnR binding (M10 
promoter) or even resulted in an apparent higher affinity 
(M7 promoter). Third, the most significant result was 
obtained with the Ml promoter, detecting two shifted 
bands with the diluted protein concentration used (two 
bands can be detected with the wild-type promoter, but 
at higher concentrations; see Supplementary Figure S3). 
The unretained probe Ml decreased clearly as compared 
to the wild-type promoter (top left EMSA in Figure 7). 
These results suggest that the Ml mutation increased the 
GlnR affinity. The second band formed by GlnR in the 



Ml promoter can also be seen in the double mutant 
M(l +8). In summary, the mutation in the Ml promoter 
enhanced binding of GlnR whereas it affects moderately 
the binding of PhoP. 

DISCUSSION 

GlnR binds to the glnA promoter with higher 
affinity than PhoP 

From the initial experiments described in this article it is 
clear that both GlnR and PhoP bind to the glnA promoter, 
used in this work as a model of nitrogen-regulated pro- 
moters. Both DNA strands are protected in DNasel foot- 
printing experiments. The nucleotide sequences recognized 
by both proteins are overlapping, although in different 
strands, as shown in the present study. When both regula- 
tory proteins were mixed the glnA promoter interacted 
preferentially with GlnR (Figure 2). When low concentra- 
tions of GlnR were used in presence of large concentrations 
of PhoP (up to 32-fold higher concentration of PhoP), it 
was clearly observed that GlnR showed a higher affinity 
than PhoP for the glnA promoter and the G\nR-glnAp 
complex was mostly predominant. 

This preference of the glnA promoter by GlnR is logic 
because GlnA is clearly related with the nitrogen assimi- 
lation and, therefore, GlnR is its main regulator (22). In 
addition, the presence of GlnR is required for glnA ex- 
pression since a deletion mutant in the glnR gene is glu- 
tamine auxotrophic (22). However, PhoP is able to bind 
the glnA promoter even in the presence of low concentra- 
tions of GlnR, affecting its expression. This negative effect 
of PhoP has been shown in vivo previously by microarray 
studies using a AphoP mutant. In this mutant the expres- 
sion of glnA and other nitrogen-related genes is higher 
than in the wild-type strain in phosphate-limiting condi- 
tions (7). In the case of glnA the negative role of PhoP 
over its expression is by direct competition with GlnR, 
since the sequences recognized by these proteins are 
overlapping (Figure 2) and only one of the two regulatory 
proteins can be bound to the promoter in a certain 
moment. 
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Figure 6. Sequence walker analysis of the PhoP-binding sites in the non-coding strand of gin A promoters. Sequence walkers (35) serve, for individual 
sequences, to show the contribution of each base to the conservation of the DRu. The full-length sequence walker (top) corresponds to the wild-type 
promoter; only the modified DRus are shown for the mutants, with the changed nucleotides over the walker. Analyses by EMSA of the promoters 
are shown in both sides. P, probe without protein; 1-4, increasing concentrations of GST-PhoP DBD protein (from 0.125 jiM to 1 |iM). For M2, M5 
and M(l + 5) promoters only the highest concentration (1 jjM) is shown. 
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Figure 7. Sequence walker analysis of the GlnR-binding sites in the coding strand of gin A promoters. The full-length sequence walker (top) 
corresponds to the wild-type promoter; only the modified repetitions are shown for the mutants, with the changed nucleotides over the walkers. 
Analyses by EMSA of the promoters are shown in both sides. P, probe without protein; 1-4, increasing concentrations of GlnR protein (from 
0.03125 uM to 0.25 uM). For M5, M9 and M(l + 5) promoters only the highest concentration (0.25 uM) is shown. 



1780 Nucleic Acids Research, 2013, Vol. 41, No. 3 



A different situation has been found in the amtB 
promoter that is also regulated by PhoP and GlnR. In 
this case, the PHO box and the GlnR box do not 
overlap and the binding of both proteins occurs resulting 
in a large DNA-PhoP-GlnR complex observed in the 
supershift (Figure 3). Although PHO and GlnR boxes 
are not overlapping they are very close and the binding 
of PhoP can disturb the correct binding of GlnR. In fact, 
footprinting analysis showed that the protected region 
by PhoP extends the limits of the boxes [Figure 3B, 
according to (25)]. This extended protection is usual 
with most proteins since they cover a stretch of DNA 
on both sides of the core recognition sequences, and 
it also happens with GlnR (Figure 5C). The negative 
effect of PhoP over amtB expression was supported 
by in vivo experiments using a AphoP mutant (7) and is 
explained by a hindrance effect resulting from PhoP 
binding to DRu-3 that disturbs GlnR binding to GlnR-1 
(Figure 3B). 

New insights into the GlnR box 

Based on 13 promoters recognized by GlnR (24), we have 
built two information-based models to describe the GlnR 
operators that were corroborated by new footprinting 
results (Figures 4 and 5). The model GlnR/22 was built 
from 22-nt sequences (GlnR boxes) and reflects the higher 
conservation of the second 11 -nt repeat (Figure 4B). In a 
recent article on GlnR of Streptomyces venezuelae pub- 
lished during the elaboration of the present article the 
authors describe a GlnR box of 16nt (40) corresponding 
to the 1 1 nt of the a-site and the 5 well-conserved nucleo- 
tides of the b-site. This model does not differ so much of 
our model GlnR/22. Tiffert et al. (24) proposed that the 
GlnR protein binds the complete box as a dimer. The 
GlnR-binding site of SC07155, however, is composed of 
three direct repeats of lint, instead of only two. Recent 
protection analyses and directed mutagenesis studies of 
PhoP in S. coelicolor (15) and the structure of the 
DNA-binding domain of the E. coli homologous protein 
(41) show that the binding sites of these response regula- 
tors are composed of two, three or more DRus of 11 -nt 
each (corresponding to a full turn of DNA double helix in 
the B-configuration). 

The alignment of the sequences of the 13 promoters 
recognized by GlnR (24) suggested that two consecutive 
GlnR boxes were always present in the coding strand of 
the GlnR-regulated promoters. In contrast, the footprint- 
ing results of the gdhA gene (24), and of SCO0888 (this 
work) indicate that a single GlnR box can constitute an 
operator. Moreover, the GlnR box of SCO0888 was 
found on the promoter non-coding strand. The rule that 
proposes that when two boxes are present, they are struc- 
turally consecutive does not hold strictly true in the binding 
site of the nasA promoter. Although it was reported that 
GlnR binds to non-consensus sequences in the nasA 
promoter (39) our analysis revealed the presence of GlnR 
box. The protected sequences reported by Wang and 
Zhao (39) clearly contain two boxes separated by 1 nt 
(Figure 4B). These observations indicate that the GlnR- 
binding features are more flexible than previously expected. 



Biological significance of global regulators interaction 

There is increasing evidence of genes regulated by more 
than one regulatory protein (42). We reported recently 
that both AfsR and PhoP bind and compete in the regu- 
lation of the afsS promoter in S. coelicolor (32). In this 
case, there is a reciprocal cross-talk of both regulators, 
since AfsR also shows affinity for several other phos- 
phate-regulated promoters, unlike GlnR. 

The binding of two different regulators to the same 
promoter region, allows the cell to modulate the expres- 
sion of the target gene in response to two different stress 
signals, namely phosphate and nitrogen limitation. 

This work provides an interesting example of non- 
reciprocal regulation of the GlnR-regulated gin A model 
promoter by the master PhoP protein. PhoP binds 
directly to the promoter of at least seven nitrogen 
metabolism-related genes (25; this work), whereas GlnR 
does not bind to the main phosphate regulon genes. These 
findings open the way for a better understanding of inter- 
actions between major regulation networks, what will con- 
tribute to the progress in the systems biology of 
actinomycetes (42). 

ADDED IN PROOF 

When this article was under final revision a report was 
published on the regulation of amtB promoter 
(Wang et al, 2012; J. Bacteriol., 194, 5237-5244) that 
supports our results on the interaction of PhoP and 
GlnR on the regulation of this promoter. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online: 
Supplementary Table 1 and Supplementary Figures 1-3. 
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